CDS

Accession Number TCMCG019C29075
gbkey CDS
Protein Id XP_022960744.1
Location join(84672..84770,84847..84916,85094..85191,85664..85805,85893..85993,86841..86924,87078..87128,87228..87293,87367..87432,87527..87607,87710..87787,87969..88136,88636..88714,88950..89068,89297..89668)
Gene LOC111461453
GeneID 111461453
Organism Cucurbita moschata

Protein

Length 557aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA418582
db_source XM_023104976.1
Definition putative clathrin assembly protein At5g35200 isoform X2 [Cucurbita moschata]

EGGNOG-MAPPER Annotation

COG_category TU
Description Clathrin assembly protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko04131        [VIEW IN KEGG]
KEGG_ko ko:K20043        [VIEW IN KEGG]
ko:K20044        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005886        [VIEW IN EMBL-EBI]
GO:0016020        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0071944        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGTCAGGTGGGGGTACACAGAACAGCTTAAGAAAAGCACTGGGGGCCCTGAAGGATACTACCACAGTTTCATTAGCTAAAGTTAACAGTGATTATAAGGAATTAGACATTGCTATAGTGAAGGCAACAAATCATGTTGAACGTCCTGCAAAAGAAAAACATATCCGAGCTATATTCGCGGCTATCTCAGCTACCAGGCCTAGGGCTGATGTTGCATATTGCATTCATGCTTTGGCAAGAAGATTATCAAAGACGCATAATTGGGCAGTTGCATTAAAGACTTTGGTTGTTATCCATCGTGCTTTGCGGGAAGTAGACCCCACATTTCACGAAGAACTCATTAACTATGGCAGGAGAAGAAACCACATGCTTAATTTATCTCATTTTAAGGACGATTCCAGTGCTAATGCATGGGATTATTCTGCTTGGGTTCGTTCATATGCCTTATTTTTGGAGGAGAGGTTGGAATGTTTCCGTGTGCTGAAGTATGACGTTGAGACAGATCGTGCGAGAACCAAAGATCTAGATACTGCTGAGTTGCTTGAGCAGTTGCCAGCATTACAAGAGCTTCTATATCGGGTACTTGGTTGTCAGCCTCAAGGAGCCGCCGTTCATAATTTTGTAATTCAGCTAGCCCTTTCATTGGTTGCTTCTGAGAGCATCAAAATTTATCAAGCCATCAGTGATGGTACGGTAAACTTAGTTGACAAGTTTTTCGAGATGCAACGGCAAGATGCAATGAAAGCCCTGGAAATTTACAGGCGGGCTGGCCAGCAGGCGGAGAGGCTCTCTGAGTTCTATGAAGTTTGTAAAAATCTTGATATTGGGCGTGGCCAGACATTTACAAAGATTGAACAGCCCCCTGCATCATTTTTACAAGCCATGGAAGAATATGTAAGAGAAGCTCCACGGACTTCAACCATTCGTAAGGATCAGGTTGCTGATGCTAAACTGGCTGCTCCTAAAGACATTTTGGCCATCGAGTACAAGAAGGAACCGGCAGCGCAAGTTGAACAGCCAGTGGCACCTCCACCAGCCCCGTCTCCCCCACCACCTGAACCAGTTAAAGTAGAACCAGCCGTGACTGAGCCACCTGACTTGTTGGGTTTGAATGATCCTGTACCTGAGGCTACTTCCAATTTGGATGAAAAGAATTCTCTGGCGTTGGCTATTGTCCCAGATGCCGATCAAAAAACCAGTTCTGCTCCAAGCCAAGTTAATGGTACTACAACTACCGGCTGGGAATTGGCACTTGTTACGGCACCAAGCTCAAATGAAAATGTAGCTGCTACAAGCAAATTGGCCGGAGGTTTGGACTTGCTTACATTAGACAGCTTGTATGATGATGCAATCAGAAGAAATAATCAGAACGTGAGTTACAATCCATGGGAGCCAGTCCCAGTGCATGGTGCCATGGTGCAACAACAGCCAATCCATGATCCCTTTTTTGCCTCGTCTGCTGTGGCTGCACCTCATTCAGTACAAATGTCAGCTATGGCCAACCAGCAGCAAGCTTTCATGTTGCATCAGCACCAGCATCAGCAACAGATGATGATGATGGCTCCCCCACCGCAACAGTCGAATCCTTTCGGAAATCCTCATGGAACCAATGGCCACCACTACGGTCCGGGTATGCCTGTTCATGCTTCCAATCCATATGCTGGTCTCATTTAA
Protein:  
MSGGGTQNSLRKALGALKDTTTVSLAKVNSDYKELDIAIVKATNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALREVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGAAVHNFVIQLALSLVASESIKIYQAISDGTVNLVDKFFEMQRQDAMKALEIYRRAGQQAERLSEFYEVCKNLDIGRGQTFTKIEQPPASFLQAMEEYVREAPRTSTIRKDQVADAKLAAPKDILAIEYKKEPAAQVEQPVAPPPAPSPPPPEPVKVEPAVTEPPDLLGLNDPVPEATSNLDEKNSLALAIVPDADQKTSSAPSQVNGTTTTGWELALVTAPSSNENVAATSKLAGGLDLLTLDSLYDDAIRRNNQNVSYNPWEPVPVHGAMVQQQPIHDPFFASSAVAAPHSVQMSAMANQQQAFMLHQHQHQQQMMMMAPPPQQSNPFGNPHGTNGHHYGPGMPVHASNPYAGLI